NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Generating Sequences by Learning to Self-Correct

Welleck, Sean; Lu, Ximing; West, Peter; Brahman, Faeze; Shen, Tianxiao; Khashabi, Daniel; Choi, Yejin (July 2023, The Eleventh International Conference on Learning Representations)

Sequence generation applications require satisfying semantic constraints, such as ensuring that programs are correct, using certain keywords, or avoiding undesirable content. Language models, whether fine-tuned or prompted with few-shot demonstrations, frequently violate these constraints, and lack a mechanism to iteratively revise their outputs. Moreover, some powerful language models are of extreme scale or inaccessible, making it inefficient, if not infeasible, to update their parameters for task-specific adaptation. We present Self-Correction, an approach that decouples an imperfect base generator (an off-the-shelf language model or supervised sequence-to-sequence model) from a separate corrector that learns to iteratively correct imperfect generations. To train the corrector, we propose an online training procedure that can use either scalar or natural language feedback on intermediate imperfect generations. We show that Self-Correction improves upon the base generator in three diverse generation tasks - mathematical program synthesis, lexically-constrained generation, and toxicity control - even when the corrector is much smaller than the base generator.
more » « less
Full Text Available
Rainier: Reinforced Knowledge Introspector for Commonsense Question Answering

https://doi.org/10.18653/v1/2022.emnlp-main.611

Liu, Jiacheng; Hallinan, Skyler; Lu, Ximing; He, Pengfei; Welleck, Sean; Hajishirzi, Hannaneh; Choi, Yejin (January 2022, EMNLP)

Full Text Available
DExperts: Decoding-Time Controlled Text Generation with Experts and Anti-Experts

https://doi.org/10.18653/v1/2021.acl-long.522

Liu, Alisa; Sap, Maarten; Lu, Ximing; Swayamdipta, Swabha; Bhagavatula, Chandra; Smith, Noah A.; Choi, Yejin (January 2021, Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers))

Despite recent advances in natural language generation, it remains challenging to control attributes of generated text. We propose DExperts: Decoding-time Experts, a decoding-time method for controlled text generation that combines a pretrained language model with “expert” LMs and/or “anti-expert” LMs in a product of experts. Intuitively, under the ensemble, tokens only get high probability if they are considered likely by the experts, and unlikely by the anti-experts. We apply DExperts to language detoxification and sentiment-controlled generation, where we outperform existing controllable generation methods on both automatic and human evaluations. Moreover, because DExperts operates only on the output of the pretrained LM, it is effective with (anti-)experts of smaller size, including when operating on GPT-3. Our work highlights the promise of tuning small LMs on text with (un)desirable attributes for efficient decoding-time steering.
more » « less
Full Text Available
On-the-Fly Attention Modulation for Neural Generation

https://doi.org/10.18653/v1/2021.findings-acl.107

Dong, Yue; Bhagavatula, Chandra; Lu, Ximing; Hwang, Jena D.; Bosselut, Antoine; Cheung, Jackie Chi; Choi, Yejin (January 2021, Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021)

Despite considerable advancements with deep neural language models (LMs), neural text generation still suffers from degeneration: the generated text is repetitive, generic, selfcontradictory, and often lacks commonsense. Our analyses on sentence-level attention patterns in LMs reveal that neural degeneration may be associated with insufficient learning of task-specific characteristics by the attention mechanism. This finding motivates onthe-fly attention modulation1– a simple but effective method that enables the injection of priors into attention computation during inference. Automatic and human evaluation results on three text generation benchmarks demonstrate that attention modulation helps LMs generate text with enhanced fluency, creativity, and commonsense reasoning, in addition to significantly reduce sentence-level repetition.
more » « less
Full Text Available

Search for: All records